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Abstract 

In their 1993 paper, Arak, Clifford and Surgailis discussed a new model 
of random planar graph. As a particular case, that model yields tessel- 
lations with only T- vertices (T-tessellations). Using a similar approach 
involving Poisson lines, a new model of random T-tessellations is proposed. 
Campbell measures, Papangelou kernels and Georgii-Nguyen-Zessin for- 
mulae are translated from point process theory to random T-tessellations. 
It is shown that the new model shows properties similar to the Poisson 
point process and can therefore be considered as a completely random 
T-tessellation. Gibbs variants are introduced leading to models of ran- 
dom T-tessellations where selected features are controlled. Gibbs random 
T-tessellations are expected to better represent observed tessellations. As 
numerical experiments are a key tool for investigating Gibbs models, we 
derive a simulation algorithm of the Metropolis-Hastings-Green family. 
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1 Introduction 

Random tessellations are attractive mathematical objects, from both theoretical 
and practical points of view. The study of the mathematical properties of 
these objects is still leading to open problems, while the range of applications 
covers a broad panel of scientific domains such as astronomy, geophysics, image 
processing or environmental sciences [HI [^. 

When modelling real-world structures, one aims at flexible classes of ran- 
dom models able to represent a wide range of spatial patterns. Gibbsian point 
processes combined with Voronoi' diagrams offer such an attractive approach. 
The class of Gibbs point processes is enriched continuously (hard-core, Strauss, 
area-interaction, Quermass-interaction) . Random tessellations are obtained as 
Voronoi diagrams associated with the germs yielded by such Gibbs processes, see 
e.g. [3]. Hence a large class of models for random tessellations is made available 
for applications. Our aim is to sketch a similar theoretical framework for other 
types of tessellations which are not germ-based. We will focus on T-tessellations 
(tessellations with only T-vertices) which are naturally built using geometrical 
operators (splits, merges and flips) instead of a germ-based procedure. 

Our approach is closely related to the one used by Arak, Clifford and Sur- 
gailis in their paper [T] on a random planar graph model. In particular, a key 
ingredient is the Poisson line process. As a first step, this paper introduces a 
new model of random T-tessellations which can be considered as a completely 
random model. Then Gibbs variations are proposed and a general algorithm for 
simulating them is described. 

Throughout the paper, we focus on the case where the domain of interest 
is bounded. Extension of Gibbs models for T-tessellations of the whole plane 
remains an open problem at this stage. 

Section [2] provides definitions, notations and basic results about T-tessella- 
tions. The completely random T-tessellation model is discussed in Section |3j 
This section also introduces for arbitrary random T-tessellations Campbell mea- 
sures, Papangelou kernels which are widely used in point process theory. Our 
new model can be considered as a T-tessellation analog to the Poisson point pro- 
cess. This claim is based on Georgii-Nguyen-Zessin type formulae. Therefore 
the T-tessellation model introduced in Section [3] is referred to as a completely 
random T-tessellation. Gibbsian extensions are discussed in Section |4] together 
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with some examples. One example is also a particular case of Arak-Clifford- 
Surgailis random graph model when its parameters are chosen in order to yield 
a T-tessellation. Gcorgii-Nguyen-Zessin formulae are provided for hereditary 
Gibbs models. In Section [5] a simulation algorithm is proposed. The design of 
the algorithm follows the general principles of Metropolis-Hastings-Green algo- 
rithms, already widely used for Gibbs point processes. It involves three types 
of local operators: split, merge and flip. Conditions ensuring the convergence 
of the Markov chain to the target distribution are provided. 

Below, as a notational convention, bold letters are used for denoting random 
variables. Detailed proofs of the main results are postponed in appendices. 

2 The space of T-tessellations 

In this paper, we shall consider only bounded tessellations. The domain of 
interest is denoted by D. It is assumed to be compact. For sake of simplicity, it 
is supposed to be also convex and polygonal. Let a{D), 1{D), nc{D) and ny{D) 
be respectively the area, the perimeter length, the numbers of sides and vertices 
oiD. 

A polygonal tessellation of Z? is a finite subdivision of D into polygonal sets 
(called cells) with disjoint interiors. The tessellation vertices are cell vertices. 
Edges are defined as line segments contained in cell edges, ending at vertices 
and containing no other vertex between their ends. The whole set of tessellation 
edges can be partitioned into maximal subsets of aligned and contiguous edges, 
called segments. 

We shall focus on tessellations with only T-vertices. Such tessellations are 
called here T-tessellations. An example of T-tessellation is shown in Figure [l] 

Definition 1 A tessellation vertex is said to be a T-vertex ij it lies at the 
intersection of exactly three edges and if, among those three, two are aligned. 

A polygonal tessellation of D is called a T-tessellation of D if all its vertices, 
except those of D, are T-vertices and if it has no pair of distinct and aligned 
segments. 

The space of T-tessellations of the domain D is denoted by T. For any T- 
tessellation T G T, let C{T) be the set of its cells, V{T) the set of its vertices, 
E{T) the set of its edges and S[T) the set of its segments. The numbers of 
cells, vertices, edges and segments are denoted by nc(T), n^{T), no(T) and 
UsiT). Vertices, edges and segments are said to be internal when they do not 
lie on the boundary of the domain D. Sets and numbers of internal features are 

specified by adding a circle on top of symbols: e.g. V{T) and n^{T) for the set 
and the number of internal vertices. The numbers of vertices, edges and cells 
are closely related to the numbers of segments. The results given below hold 
under the condition that no internal segment ends at a vertex of the domain D. 
Any vertex is either a vertex of D or the end of an internal segment. Since an 
internal segment has 2 ends, 

n^{T) = n^{D) + 2n,{T). (1) 

The sum of vertex degrees in a graph is twice the number of its edges. Vertices 
in a T-tessellation have degree 2 (for a vertex of D) or 3. This observation 
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Figure 1: Example of a T-tessellation with its components. 



combined with equality ([ij yields the following identity 

ne(T) =nvp)+3n,(r). (2) 

Finally, using the Euler formula, one gets 

ne(T) = n,(T) + l. (3) 

A T-tessellation yields a unique line pattern L{T) defined as the set of lines 
supporting its segments. Conversely, for any finite pattern L of lines hitting 
D, there are generally several T-tessellations T such that L{T) — L. This set 
of tessellations is denoted T{L) and its number of elements nt(L). Kahn \TU[ 
Theorem 3.4] proved the following inequality 



k 



k — k I log k 



where fc is the number of lines in L, e is an arbitrary positive real number and C 
a constant only depending on e. In the next sections, inequality Q will be used 
for deriving further results involving so-called stable functionals. The stability 
condition defined below is similar to the one used for point processes (see [S]). 
It ensures that unnormalized densities are integrable and define distributions 
on T (see sections [3] and [3]) . 

Definition 2 A non-negative functional F on T is said to be stable if there 
exists a real constant K such that F(T) < iir"'(^) for any T ^ T. 

Below it is noticed that the whole space T can be visited by means of three 
local and simple operators: splits, merges and flips. 

A split is the division of a T-tessellation cell by a line segment, see figures [2a| 



and 2b The continuous space of all possible splits for a given T-tessellation T 



is denoted by St- 
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(a) A T-tessellation. 





(b) Split: the new (black) (c) Merge: removal of a 
edge is blocked by two ex- (light grey) non-blocking 
isting segments. segment (in the top-left cor- 

ner). 




(d) Flip: two segments are 
modified, one is shortened 
(suppressed edge in light 
grey) while the other one 
is extended (added edge in 
black) . 



Figure 2: Splits, merges and flips are local transformations of T-tessellations. 
Bottom row: transformations of the T-tessellation shown on the top row. 



A merge is the deletion of a segment consisting of a single edge, see fig- 



ures 2a and 2c Such a segment is said to be non-blocking. Other segments 
are called blocking segments. The finite space of all possible merges for a given 
T-tessellation T is denoted by A4t- Note that Mt may be empty. For any T- 
tessellation T, there are Ug^nh (T) possible merges, where rig, nb {T) is the number 
of internal non-blocking segments of T. 

A flip is the deletion of an edge at the end of a blocking segment combined 
with the addition of a new edge. At one of the vertices of the deleted edge, a 
segment is blocked. The new edge extends the blocked segment until the next 



segment, see Figure 2d There are 2ns,b (T) possible flips, where Ug^h (T) is the 
number of internal blocking segments in tessellation T. The finite space of all 
possible flips for a given T-tessellation T is denoted by J^t- 

Note that any split can be reversed by a merge (and vice- versa). Also any 
flip can be reversed by another flip. Using these three types of updates it is 
possible to visit any part of the space T- Below the so-called empty tessellation 
refers to the trivial T-tessellation Td that consists of a single cell extending to 
the whole domain D. 

Proposition 1 Let T be a non-empty T-tessellation. There exists a finite se- 
quence of merges and flips which transforms T into the empty tessellation Tjj . 
Conversely there exists a finite sequence of splits and flips that transforms the 
empty tessellation To into T. 

In order to empty a given T-tessellation, one can start by removing every non- 
blocking segments (merges) . If at some point there is no non-blocking segment 
left, one can make a blocking segment non-blocking by a finite series of flips. 
The reverse series of inverted operations (splits and flips) defines a path from 
the empty tessellation to the considered T-tessellation. 

Corollary 1 Consider any pair of distinct T-tessellations. There exists a finite 
sequence of splits, merges and flips which transforms one T-tessellation into the 
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other one. 

Next sections will involve uniform (probability) measures on the three spaces 
St, Mt and Ft- Let us introduce these measures here. Since the spaces Mt 
and Ft are finite, any probability measure on those spaces is just a finite set 
of probability values. In particular, one may consider uniform distributions on 
the finite spaces AAt and Ft- Defining a probability measure on the continuous 
space St is less obvious. 

Any split of T can be represented by a pair (c, I) where c is a cell of T and 
I is a line splitting c. Therefore St can be identified to the set of pairs (c, I) 
such that I splits c. Consider the counting measure on the set of cells of T. 
The space of planar lines is endowed with the unique (up to a multiplicative 
constant) Haar measure (i.e. invariant under translations and rotations). Here 
we consider the Haar measure scaled such that the mass of the subset of lines 
hitting say the unit square is equal to 4/7r. The space St of splits applicable to 
T is endowed with the product of both measures restricted to the set of pairs 
(c, /) such that I splits c. The infinitesimal volume element for that measure at 
a split S G St is denoted by dS- The measure dS can be written as 

dS = ^ l{cni#0} dl, (5) 
cec(T) 

where 1 is the indicator function and dl denotes the infinitesimal volume element 
of the Haar measure on the space of lines. Using standard results from integral 
geometry (Crofton formula, see e.g. 0), it can be checked that the total mass 
of that measure is equal to 

21{T) - 1{D) 

TT 

where 1{T) is the total edge length of T. Note that 21{T) - 1{D) is the sum 
of cell perimeters. The measure on St defined above can be normalized into a 
probability measure. Below, that probability measure on St is referred to as 
the uniform probability measure on St and a random split S of T is said to be 
uniform if its distribution is the uniform probability measure on St- Picking a 
split with a uniform distribution can be done using a two-step procedure: 

1. Select a cell c of the T-tessellation T with a probability proportional to 
its perimeter. 

2. Select the line splitting c according to the uniform and isotropic probability 
measure on the set of lines hitting c. 



3 A completely random T-tessellation 

Let us start with a formal definition of a random T-tessellation. The space T 
is equipped with the standard hitting cr-algebra cr(T) (see [TS]) generated by 
events of the form 

where K runs through the set of compact subsets of D. A random T-tessellation 
is a random variable T taking values in [T, cr(T)). 
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Our candidate of completely random T-tessellation is based on the Poisson 
line process, denoted here by L. The process L is supposed to have a unit linear 
intensity (mean length per unit area) . Since only T-tessellations of the bounded 
domain D are considered, only the restriction of L to D, also denoted L for sake 
of simplicity, is relevant. Consider the probability measure /i on cr(T) defined 

by 

^^{A) = Z-'E 1a{T), AeaiT), (6) 

Ter(L) 

where Z is a normalizing constant. In order to prove that fi is well-defined, it 
must be checked that the mean number of T-tessellations in T(L) is finite. The 
latter result is a consequence of the following more general result. 

Theorem 1 Let L be the unit Poisson line process restricted to the bounded 
domain D and let F be a stable functional on T ■ Then 

E ^ F{TY 
Ter(L) 

is finite for any real x. 

Theorem [T] can be proved based of inequality Q as shown in Appendix |A] Ap- 
plying Theorem [T] with _F = 1 proves that the measure /i defined by Equation 
^ is indeed finite. The normalizing constant Z has no known analytical ex- 
pression. As a further consequence of Theorem [l] for T ^ /i and for any stable 
functional F, all moments of i^(T) are finite. 

If T '-^ then given L(T) — L, T is uniformly distributed on the finite set 
T{L). It should be noticed that L{T) is not Poisson. In particular, given the 
number of lines in L{T) , the probability of a configuration of lines is proportional 
to the size of T(L), i.e. to the number of T-tessellations supported by the lines. 
Further insights into the distribution /i can be gained based on Papangelou 
kernels. 

Papangelou kernels were defined for point processes [3T1 [TT] . Heuristically, 
the (first-order) Papangelou kernel is the conditional probability to have a point 
at a given location given the point process outside this location. Formally, the 
Papangelou kernel is defined based on a decomposition of the reduced Campbell 
measure. The latter can be interpreted as the joint distribution of a typical point 
of a point process and the rest of the point process. 

Extending these notions to arbitrary random T-tessellations is not straight- 
forward because tessellation components (vertices, edges) are more geometri- 
cally constrained than isolated points. In T-tessellations, features that can 
easily be added or removed are (non-blocking) segments. Below, segments that 
can be added to a T-tessellation T will be identified to splits and St will de- 
note the space of such segments. Conversely, Aix will denote the (finite) set of 
non-blocking segments that can be removed from T. Let Cs be the space 

C,^{is,T):TeT,s€ST}. 

Definition 3 The split (reduced) Campbell measure of a random T-tessellation 
T is defined by the following identity 

C,((/.)=E '/'KT\{m}), (b:C,^R. (7) 
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The split Papangelou kernel is the kernel Pg characterized by the identity 



C,(0)=E /" (/.(s,T)P,(T,ds), 0:C,^M. (8) 

J St 

Note that for a point process, the reduced CampbeU measure is defined by 
replacing the sum in Equation ([7| by a sum on all points of the process. For a T- 
tessellation, the sum is reduced to segments that are non-blocking. Continuing 
the comparison with point processes, it should be noticed that the Papangelou 
kernel Ps(T,ds) is a measure on a space which depends on the realization T 
while the Papangelou kernel of a planar point process is just a measure on M^. 

A simple analytical expression of the split Papangelou kernel is available for 
the random T-tessellation T ^ /i. Section [2] introduced a uniform measure on 
splits. This measure with infinitesimal element denoted dS induces a measure 
on non-blocking segments that can be added to a T-tessellation. Below an 
infinitesimal element of the latter measure is denoted by ds. 

Proposition 2 The split Papangelou kernel of a random T-tessellation T ^ fi 
can be expressed as 

Ps{T,ds)^ds. (9) 

The proof of that proposition, detailed in Appendix [B} is based on a fundamen- 
tal property of the Campbell measure for Poisson (line) processes. Note that a 
Poisson line process is involved in the definition of the measure fi defined on T, 
see Equation This result about the split Papangelou kernel is to be com- 
pared with the analog result concerning the Papangelou kernel of Poisson point 
processes. A heuristic interpretation of Proposition [2] is that the conditional 
distribution of a non-blocking segment of T ^ /i docs not depend on the rest 
of T (i.e. is uniform). As a direct consequence of Proposition [2| we have the 
following Georgii-Nguyen-Zessin formula. 

Corollary 2 IfT^fi, then the following identity holds 

E 0(to,T\{to}) =E / (/)(s,T)ds, Cs-.^-^U. (10) 

Note that this result can be reformulated in terms of splits and merges. In 



particular, in the left-hand side of (10 1, T \ {m} can be written as MT where 



M is the merge consisting of removing the non-blocking segment m from T. 
Furthermore, the removable segment m is identified to the split M^^. The 
reformulation is as follows: 

E J2 <l> {M-\ MT) = E f (l){S,T)dS. (11) 

J St 



By analogy, a further result can be obtained for flips. The flip Campbell 
measure is defined on the space 

Ci = {{F,T):T ^T,F ^Ft}. 
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Definition 4 The flip Campbell measure of a given random T-tessellation T is 
defined by the following identity 

Cf{<ly)^¥.Y^cj,{F-\FT), c^-.Cf^R. (12) 

Fe^T 

The flip Papangelou kernel is the kernel Pf characterized by the identity 

C/(0)=E ^ 0(^^,T)PXT,F), 0:C/^M. (13) 
FeJ^T 

Again a simple analytical expression of the flip Papangelou kernel can be derived 
for T ~ /i. The derivation of this expression is based on the definition ^ 
involving a flat sum of T-tessellations supported by a line pattern. 

Proposition 3 The flip Papangelou kernel of a random T-tessellation T ^ fi 
can be expressed as 

Pf{T,F)^l. (14) 

A detailed proof is provided in Appendix [C] Again, a heuristic interpretation 
of Proposition [3] is that all flips have the same conditional probability given the 
rest of T. Proposition [3] leads to a Georgii-Nguyen-Zessin formula for flips. 

Corollary 3 IfT^fi, then the following identity holds 

E ^ 0(i^-i,FT) =E ^ 0(F,T), 0:C/->M. (15) 

Hence expressions of Papangelou kernels for splits and flips indicate that a 
random T-tessellation T ^ fi shows minimal spatial dependency, suggesting that 
H can be considered as the distribution of a completely random T-tessellation. 
Below this model of completely random T-tessellation will be referred to as the 
CRTT model. 



4 Gibbsian T-tessellations 

Although the completely random T-tessellation model introduced in the pre- 
vious section shows appealing features, it may not be appropriate for repre- 
senting real life structures which may exhibit some kind of regularity. This 
section is devoted to Gibbsian extensions allowing to control a large spectrum 
of T-tessellation features. Gibbs random T-tessellations are defined as follows. 

Definition 5 Let h be a stable non-negative functional on T. The Gibbs ran- 
dom T-tessellation with unnormalized density h is the random T-tessellation 
with distribution 

P{dT) cx h[T) ^i{dT), (16) 

where the proportionality constant is defined such that P has total mass equal 
to 1. 



Equation ( 16 ) defines a probabilistic measure on T if and only if the unnor- 



malized density h has a non-null finite integral with respect to /i. The latter 
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condition is guaranteed by the assumed stabihty of h. Note that for point pro- 
cesses (in bounded domains) too, the integrabihty of unnormahzed densities 
with respect to the Poisson distribution is guaranted by a stabihty condition, 
see e.g. [6l Section 3.7] or [22l Chapter 3]. Following the Gibbsian terminology, 
— log h is referred to as the energy function. 

Extending results of Section |3] the split and flip Papangelou kernels can be 
derived for Gibbs T-tessellations. As for Gibbs point processes, such results are 
obtained under a kind of hereditary condition. 

Definition 6 A Gibbs random T-tessellation with unnormahzed density h is 
hereditary if it fulfills the following two condititions : 

• For any pair (T, S), T e T and S e St, h{ST) = h{T) ^ 0. 

• For any pair (T, F),T eT and F G J't, h{FT) = ^ h{T) = 0. 

Proposition A If T is a Gibbs random T-tessellation with hereditary unnor- 
mahzed density h, then its split and flip Papangelou kernels can he expressed as 
follows 

hjTUjs}) ^ h{FT) 

h{T) ^^^^'^) = T(rr- 

And the following two Georgii-Nguyen-Zessin formulae hold 



E '/'("i,T\{m}) = E / 0(s,T) 



^(TUM) 
h{T) 



ds, (18) 



h ( FT\ 

E ^ HF-\FT) = E ^ 0(F,T)^J— (19) 
taking 0/0=0. 

Proposition |4] is a straightforward consequence of propositions [2||3] and corollar- 
ies EHS] 

Example 1 A first simple example is obtained with an energy function 

-log/i(r) = -n,(r)logr, (20) 

which depends only on the number of internal segments (i.e. the number of 
supporting lines) . The parameter r may be tuned in order to control the number 



of cells. For the energy function (20), the Papangelou kernels are given by 



Ps{T,ds)=Tds, Pi{T,F) = l. 

Note that the Papangelou kernels do not depend on T. Therefore the model 
defined by the energy (pO| behaves like the completely random T-tessellation 
introduced in Section |3j up to the scaling parameter r. Below this model is 
referred to as the completely random T-tessellation with scaling parameter r. 
A realization of such a model is shown in Figure |3a[ It was generated using the 
Metropolis-Hastings-Green algorithm further described in Section [5] □ 
Example 2 The energy function 

-\ogh{T) = -l{T)+n^{T)\og2-~n,{T)\ogT (21) 

TT 
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yields a model of T-tessellation proposed by Arak et al. [T]. In their paper, 
Arak et al. introduced a general probabilistic model for planar random graphs. 
This model involves 2 scalar and 3 functional parameters. Depending on these 
parameters, the model may yield segment patterns, polyline patterns, polygons 
or T-tessellations. A realization of the Arak-Clifford-Surgailis model is shown 
in Figure |3b[ The parameter r was adjusted in order to yield T-tessellations 
with approximately the same mean number of cells as the CRTT model shown 
in Figure [3a| 



For the energy (21), the normalizing constant (partition function) has a 
simple analytical expression. The parameter r is a scaling parameter. It can 
be checked that the linear intensity of this random T-tessellation is equal to 
T. Furthermore, the model has a Markov spatial property in the sense that the 
conditional distribution of T on a region B given the complement D\B depends 
on the complement only through its intersection with the boundary of B. Also 
Arak et al. provided a one-pass algorithm for simulating their model. Miles and 
Mackisack [T7] have derived the distributions of edge and segment lengths and 
of cell areas. 

The split and flip Papangelou kernels of the Arak-Clifford-Surgailis model 



can be computed from equations (171 and ( 21 ) . In particular, for a splitting seg- 
ment s with no end lying on the boundary of the domain D, the split Papangelou 
kernel has a log-density equal to 

fog ^^^^ = - -/(,s) - 2 log 2 + fog r, 

as TT 

where l{s) is the segment length. Compared to the CRTT model, splits by 
short (non-blocking) segments are favoured. A splitting segment is short if 
the splitted cell is small or if it lies at the periphery of large cells. Therefore 
one may expect ACS tessellations to show more small cells than CRTT. This 
was checked numerically. Large samples of realizations of both the CRTT and 
the ACS model were generated. Cell area distributions were plotted as Lorenz 
curves (fraction of smallest cells versus area fraction of the covered domain) , see 
Figure |4j Plots confirmed that the ACS model tends to yield more small cells 
than the CRTT model. □ 
Example 3 In order to obtain T-tessellations with less dispersed cell areas, 
one may consider the following energy function 

-logh{T) = -n,{T) log T + aa''iT), (22) 

where (T) is the sum of squared cell areas for the tessellation T. The statistic 
a^{T) is minimal for equal size cells. Furthermore note that since a?{T) < 



a{D) , the energy (22) is stable. Therefore, the probabilistic measure (116) is 



well-defined for energy ( 22 ) . A realization of this model is shown in Figure 



3c 



The values of t and a were adjusted in order to produce approximately the 



same mean number of cells as in figures 3a and 3b The Lorenz curve of Figure 



|4]confirmcd that the area penalty in (22 ) yields much less small cells than under 
the CRTT model. □ 
Example 4 In order to control angles between incident segments, one may 
consider the following energy 

-log/i(r) = -n,(r)logr-/3 ^ 0(z;), (23) 

v<£V{T) 
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(a) CRTT. T = 1.9. 



(b) ACS. r = 10.75. 




(c) Area penalty, t = 0.043, (d) Angle penalty, t = 12.1, 

a = 10,000. /3 = 2.5. 

Figure 3: Realizations of four model examples. 



where 4>{v) denotes the smallest angle between two edges incident to v (</>(«) is 
close to 7r/2 if the segments meeting at v are almost perpendicular). Since the 
angle (j){v) is bounded, the energy (23) is stable. A realization of this model is 
shown in Figure |3d| Histograms of angles measured from simululated tessella- 
tions are shown in Figure [4] The angle distribution is much more concentrated 
towards tt/2 than under the CRTT model. □ 



5 A Metropolis-Hastings-Green simulation al- 
gorithm 

In this section, we derive a simulation algorithm for random Gibbs T-tessellations 
This algorithm is a special case of the ubiquitous Metropolis-Hastings-Green al- 
gorithm, see e.g. [Il HI |^. It consists in designing a Markov chain with state 
space T and with invariant distribution the target probability measure P. 

The design of a Metropolis-Hastings-Green algorithm involves two basic 
ingredients: random proposals of updates and rules for accepting or reject- 
ing updates. Here, three types of updates are considered: splits, merges and 
flips. Proposition kernels based on these updates are not absolutely continuous 
with respect to the reference measure fj,. Thus like the birth-death Metropolis- 
Hastings algorithm used to simulate Gibbs point processes, the simulation algo- 
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Figure 4: Statistical comparisons between three Gibbs model examples and the 
CRTT model. Left: distributions of cell areas of the CRTT, ACS and squared 
area penalty models shown as Lorenz curves (x-axis : fraction of smallest cells, 
y-axis : covered area fraction, discontinuous : theoretical curve for cells with 
uniformly distributed areas). Right : distribution of acute angles between inci- 
dent segments of the CRTT and angle penalty model. 



rithm described below is an instance of Green extension of Metropolis-Hastings 
algorithm. Rules for accepting updates are based on Hastings ratios. The com- 
putation of Hastings ratios involves ratios of densities with respect to symmetric 
measures on pairs of T-tessellations which differ by a single update. 

5.1 Symmetric measures on pairs of tessellations 

The space of pairs of T-tessellations of the domain D which differ only by a single 
split, merge or flip is denoted T^f. That space can be decomposed into two 
subspaces (pairs which differ by a split or a merge) and 7^^ (pairs which 
differ by a flip). As a preliminary to the design of the simulation algorithm, 
symmetric measures on and on 7^^ are required. A measure v on say is 
symmetric if for any measurable subset A C 7^, the following equality holds 

The following result is a direct consequence of the Georgii-Nguyen-Zessin 
formulae for the completely random T-tessellation. 

Proposition 5 Let T ^ /i. The measures ^.sm on Tj,-^ and on defined by 
^^UA) = E [ 1a (T, ST) dS + E ^^iT, AfT), A C Tl, (24) 

and 

fifiA)^E ^a{T,FT), Act/, (25) 
FeJ^T 

are symmetric. 

A detailed proof is given in Appendix [D] 
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5.2 The algorithm 

The proposed algorithm proceeds by successive updates. It involves two propo- 
sition kernels. The first kernel is based on splits and merges. It is quite similar 
to births and deaths used for simulating point processes. Consider two non- 
negative functions Ps and Pm defined on T such that Ps + Pm < 1- For any T- 
tessellation T, Ps{T) (resp. Pm{T)) is the probability for considering applying a 
split (resp. merge) to T. E ps{T)+p„,{T) < 1, with probability l-ps(T)-pm(T), 
no update is considered. 

The rules for selecting an update of a given type are determined by two 
families of non- negative functions Qs and The function Qs is defined on the 
set of pairs (T, S) where T e T and S' G St- For any T-tessellation T, qs{T, .) 
is supposed to be a probability density with respect to the uniform measure 
on St defined by Equation Concerning merges, qm{T, .) is just a discrete 
distribution (finite family of probabilities) defined on A4t- 

Consider the following proposition kernel on T: 



Qsm(r,dr) 



p,{T)q,{T,ST)dS if f = 5T, SgSt, 
P^{T)q,^{T,MT) iff = MT, M&Mt- 



Acceptation of an update is based on the following Hastings ratio 

h{f)qUT,T) 

where qsmiT, T) is the density of QsmM with respect to the symmetric measure 
/ism- It can be easily checked that 



qsmiT,dT) 



Ps{T)qs{T, ST) iff ^ ST, Se St, 
pUT)q.a{T,MT) xif = MT, M e Mt- 



Therefore the Hastings ratio for splits and merges can be expressed as 
rrm HUT)pt-.{UT)qt-.{UT,U-') 

where t € {s, m}, s~'^ — m, m~^ = s, C/ G iSt if i = s and U G M.t if t = m. 

Using splits and merges, one would explore only a subspace of T: nested 
T-tessellations. Hence, updates based on flips are required. Let pi = 1—ps —Pm 
be the probability of trying to flip the current tessellation. And, for any T ^ T, 
let qi{T,.) be a finite distribution defined on J-V- The pair (pf,qf) defines a 
proposition kernel Q[. It is easy to check that Qifi is absolutely continuous 
with respect to the symmetric measure fj,{ with density ptqf. Furthermore, the 
Hastings ratio for fiips can be expressed as in Equation (26) where t = f, f~^ = f 
and U G Jt- 

The simulation algorithm requires as inputs: the unnormalized density h of 
the target distribution, an initial T-tessellation of the domain D, the triplet 
(Ps,Pm,Pf) and the triplet (g^, g^, gf)- 

Each iteration consists of the following steps: 

1. The current T-tessellation is T„. 
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2. Draw at random the update type t from {s, m, f} with probabilities Ps{Tn), 

Pm{Tn) and PfiTn). 

3. If there is no update of type t applicable to r„, r„4-i — Tn- 

4. Draw at random an update U of type t according to the distribution 
defined by qt{T„,.). 

5. Compute the Hastings ratio rt(T„, U). 

6. Accept update (r„+i = UT„) with probability 

min{l,rt(T„,C/)}. 

Let T„ be the Markov chain defined by the algorithm above. By construction 
T„ is reversible with respect to the target distribution P. Further properties of 
T„ are discussed in Section [573) 

Example 5 A simple version of the algorithm is obtained by choosing probabil- 
ities Psi Pm a-nd pf not depending on T and by using uniform random proposals 
of updates. Define 

*n(r,M) = , \ (28) 

qf{T,F) = o \ . (29) 
2n.,b (T) 

in order to get uniformly distributed splits, merges and flips. For a split S, the 
Hastings ratio is 

^ fe(5r)p„, 2^(r)-?p) 

Mr) Psn{n.^^^{T) + l-0 

where ^ € {0, 1, 2} is the number of internal non-blocking segments of T incident 
to the ends of the splitting segment. Similary, for a merge the Hastings ratio is 



h(T) p^2{l{T)-l{M))~l{Dy 



where 1{M) is the length of the edge removed by M . For a flip F, the Hastings 
ratio is 

h{FT) n,^i,{T) 



n{T,M) 



where rig.b [FT) — ris.b {T) varies from —2 up to 2. 

The computation of Hastings ratios requires to keep track of the numbers 
of blocking and non-blocking segments, the total length of internal edges. Also, 
one needs to predict the variations of the non-blocking segment number induced 
by a split and of the blocking segment number induced by a flip. Finally one 
must also predict the variation of the unnormalized density h induced by any 
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Figure 5: A realization of the model obtained by combining energy terms in- 
volved in example models |3] and |4] Parameter values: r = 2.0, a = 93,000, 
P = 200. 



of the three updates. □ 

Simulations based on Metropolis-Hastings-Green algorithms are commonly 
monitored based on plots as those shown in Figure |6] The CRTT model and the 
model of Example [3] were considered. Additionally, another model combining 
both penalizations of examples [3] and |4] was also considered. The values of 
parameters a and /3 were chosen in order to produce rather rectangular cells. A 
realization of the latter model is shown in Figure [5] 

As a first monitoring tool, one can follow the time-evolution of the energy. 
A convenient starting initial tessellation to start with is the empty tessellation. 
The empty tessellation may have quite a high energy. Obviously the Markov 
chain should not be sampled before the energy stabilizes at its equilibrium level. 
The time needed for burnin is heavily dependent on the form of the energy 
function. As shown by Figure [6| burnin is very short for the CRTT model: 
less than 1,000 iterations. Burnin is also short for the model with penalty on 
cell area variability (Example [s]), although the initial empty tessellation shows 
a very high energy compared to the energy value at equilibrium. For a model 
with very severe penalties like the one combining penalties of examples |3] and 
[4] (realization shown in Figure [s]) , burnin takes several hundreds of thousands 
iterations. 

Acceptation rates are also quite informative. As expected, the CRTT model 
is easy to simulate with very high acceptation rates. More constrained models 
show lower acceptation rates. In model of Example [3j unbalanced splits gener- 
ating small cells or merges creating large cells are often rejected. For an extreme 
model such as the one combining examples [3] and [4j the acceptation rates drop 
down at rather low values. In such a case, it might be worthwhile to consider 
more state dependent proposals than those of Example [sj For instance, one 
could try to propose more central splits. 

In general, the simulation Markov chain shows time-correlation: two suc- 
cessive T-tessellations differ only locally. Therefore, the Markov chain is com- 
monly subsampled. The sampling period to be used depends on the range of 
time-correlations. As an empirical method for assessing time correlations, one 
may consider the percentage of segments left unchanged at different time lags. 
Plots from Figure [6] show contrasted situations. For the CRTT model, after 
500 iterations, about only 1 % of segments are left unchanged. For the model 
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(a) CRTT 
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(c) Example ^tra 



Figure 6: Plots for monitoring simulations. Top row: time-evolution of the 
energy. Middle row: time- evolution of acceptation rates (splits in black, merges 
in red, flips in green). Bottom row: fraction of common segments between two 
tessellations versus time lag (log-log scale). 
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of Example [3j such an updating rate requires more than 2,000 iterations. The 
last model shows a very poor updating: very long runs are needed in order to 
obtain uncorrelated realizations. 

5.3 Convergence 

By construction, see [8] or [6] for more details, the Markov chain T„ defined in 
Section [5?2| is reversible with respect to the target distribution P (x h/j,. In order 
to get a convergence result, one needs to establish that T„ is also irreducible 
and aperiodic, see e.g. [201 [TB] . 

Proposition 6 Under both conditions given below, the Markov chain T„ is 
irreducible. 

1. For any T-tessellation T that can be merged, there exists a merge M such 
that 

PmiT)q„,{T, M) > and h{MT)p,{MT)q,{MT, M"!) > 0. (30) 

2. For any T-tessellation T that can be flipped and any flip F G J-^, 

h{T)pf{T)qf{T,F)>0. (31) 

The detailed proof given in Appendix |E] follows the same guidelines as the proof 
used for point processes, see e.g. [Bl[7j. 

Note that the first condition in Proposition |6] is similar to the irreducibility 
condition involved in the birth and death Metropolis-Hastings algorithm for the 
simulation of Gibbs point processes. The second condition is somewhat stronger 
than the first one since it must hold for all flips. In particular the density h 
must keep strictly positive. 

For the simple version of the simulation algorithm described in Example [5] 
both conditions of Proposition|6]hold if and only if the target density h is strictly 
positive. 

Proposition 7 // either P}{Td) > or pmiTn) > 0, the Markov chain T„ is 
aperiodic. 

Proof One needs to prove that there exists a T-tessellation T such that the 
conditional probability that T„+i = T given that T„ = T is positive. Consider 
the empty tessellation Tjj. If p{{To) > 0, a flip of To is considered with a 
positive probability. As there is no flip applicable to To, T„+i = T„ = Tjj. 
The same argument holds if Pm(T_D) > 0. □ 

Since the Markov chain T„ is reversible with respect to P oc /i/i and in view 
of propositions |6] and [T] we have the following convergence result: 

Proposition 8 Let P be a distribution on T with unnormalized density h with 
respect to pi. Under the conditions of propositions^ and^ for P-almost any 
T-tessellation T the conditional distribution of the Markov chain T„ given that 
To = T converges to P in total variation. 
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6 Discussion 



The main feature of the completely random T-tessellation introduced in this 
paper is that both split and flip Papangelou kernels have very simple expressions 
showing a kind of lack of spatial dependency. It would be of high interest to 
further investigate this model. Since analytical probabilistic results are available 
for the Arak-Clifford-Surgailis (Example[2]), it is expected that such results could 
also be derived for our model. In particular, the following issues are of interest: 

• Is there a simple expression for the normalizing constant Z involved in 
Equation ^1 

• Does our model share the same Markovian property as the Arak-Clifford- 
Surgailis model? 

• Is it possible to derive an exact non-iterative simulation algorithm similar 
to the one proposed to Arak et al. [T]? 

It should be noticed that the choice of a reference measure for a class of 
Gibbs models is in some way a matter of convention. Since the CRTT model 
introduced in this paper is absolutely continuous with respect to the Arak- 
Clifford-Surgailis, one could use the ACS model as a reference measure instead. 
In particular, such a substitution would have a slight impact on the simulation 
algorithm described in Section [5} only the Hastings ratio should be modified by 
an extra multiplicative term. 

In this paper, we focused on T-tessellations of a bounded domain. We expect 
the completely random T-tessellation model to be extensible to the whole plane. 
Concerning Gibbs variations, extensions to the whole plane must involve some 
extra conditions on the unnormalized density. 

Another model for random T-tessellations is the STIT tessellation introduced 
by Nagel and Weifi |TSJ [T^. We conjecture that the STIT model can be seen 
as a Gibbs T-tessellation. However the derivation of its density with respect 
to the CRTT distribution remains an open problem. Obviously our simulation 
algorithm is not appropriate for simulating a STIT tessellation. Realizations of 
a STIT are obtained by series of splits (starting from the empty tessellation) 
generating nested T-tessellations. Modifying a nested tessellation by a flip may 
yield a non-nested tessellation. Even a modification of our algorithm consisting 
only in splits and merges would be inefficient. Given a nested T-tessellation, the 
only way to modify the first splitting segment (running across the whole domain) 
requires to remove all subsequent segments in reverse order. Fundamentally, 
STIT tessellations and the Gibbs T-tessellations considered here have different 
geometries. 

A simulation algorithm of Metropolis-Hastings-Green type has been devised. 
This algorithm is based on three local modifications: split, merge and flip. Con- 
ditions ensuring the convergence (in total variation) of the simulation Markov 
chain were derived. Also geometric ergodicity of the Markov chain has to be 
investigated. Under geometric ergodicity and extra conditions (e.g. Lyapunov 
condition), central limit theorems can be obtained for averages of functionals 
based on Monte-Carlo samples of the Markov chain T„. In particular, geometric 
ergodicity has been proved for the Metropolis-Hastings-Green algorithm devised 
for simulating Gibbs point processes, see [71 1^ . We tried to follow a similar ap- 
proach without success. The main difficulty is related to the fact that, when 
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one tries to empty a given T-tessellation, the number of operations (merges and 
flips) involved is not easily bounded. 

The pioneering work of Arak, Clifford and Surgailis gave rise to develop- 
ments by Schreiber, Kluszczynski and van Lieshout [23l [T2j [24] on polygonal 
Markov fields. Although the geometry of those polygonal fields differs from T- 
tessellations, it is of interest to notice that the simulation procedures proposed 
by these authors involve non-local updates based on so-called disagreement 
loops. It seems worthwhile to investigate whether similar non-local updates 
could be adapted for T-tessellations. 

As mentioned in Example |2j the Arak-Clifford-Surgailis model is able to 
produce planar graphs not only with T-vertices but also other types of vertices 
(X, I, L and Y). It remains an open problem how to extend the framework 
discussed here to a more general graph. Obviously, other operators than splits, 
merges and flips (as defined here) are required. 

A practical important issue is the inference of the model parameters. Since 
the likelihood is known up to an untractable normalizing constant and sim- 
ulations can be done by a Metropolis-Hastings-Green algorithm, Monte-Carlo 
maximum likelihood [5] can be considered. Another approach would consist into 
deriving a kind of pseudolikelihood similar to the one developped for point pro- 
cesses, see [9l[3]. As a starting point, one could use the Georgii-Nguyen-Zessin 
formulae for Gibbs T-tessellations derived in Proposition |4j 
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A Proof of Theorem [T] 

Let T ^ /i and F be a stable functional on T. There exists a real constant K 
such that 

TeTii^) Ter(L) 

Note that the number of internal segments in T is equal to the number k of 
lines in L. It follows 

E J2 F{Tf <Ent{L)K''^. 
Ter(L) 

Using the upper bound provided by Equation Q, one gets 

/ 1^ s k-k/logk 

E Y FiTr<c,+EI{\.>2}C^(-——) 
Ter(L) \\ & ) / 

The constant ci is related to the sum terms when the number of lines k is less 
than 2. As F is finite, the constant Ci is also finite. Furthermore, since L is a 
Poisson line process with intensity 1, k is Poisson distributed with parameter 
1{D)/tt. Let C2 be the normalizing constant of this Poisson distribution. We 
have 



Ter(L) k>2 

< C1+C2Y 



k>2 



CK^l(D)^ ^ ^ ^fe-fc/iogfc 



tt/c / \(logfc)i~ 

/ / \ — ^ / log k 



^ ^ / cK^'iiD) y / k y 
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Now consider e — 1/2. The term 



,(logfc)i/2_ 

is upper bounded. Finally it is easy to check that the series 



E 



k>2 

is finite. 



7r(log ky/ 



B Proof of Proposition [2] 

Let T ^ /i where /i is the distribution on T defined by Equation ([6]). In view of 
Definition |3] and Equation ([6]), the split Campbell measure of T can be written 
as 

TeT{L) rrieM-T 

The sum on non-blocking segments can be replaced by a sum over all lines I in 
L combined with an indicator function. Below let s(?,T) be the segment of T 
lying on the line Z of L. 

Cs(0)=E ^ l{^(i.T) is non-blocking} {s{l, T), T \ {s{l, T)}) . 

Ter(L) /GL 

For a given line configuration L and a given line / S L, consider the map 
T^f = T\s{l, T). If s{l, T) is non-blocking, T\s{l, T) is still a T-tessellation. 
Also note that T G T{L \ I). Furthermore, this map is not injective. Given a 
f eT{L\l), the subset of tessellations T such that f = T\ s{l, T) is obtained 
by running through all cells c of T hitting the line I and by splitting T by the 
line segment I Pi c. Hence we have 

l{s(i,T) is non-bloclcing}0 T),T\ {s{l, T)}) 

Ter(L) 

f er(L\o cec(f ) 

It follows 

/eL Ter(L\/) cec(T) 
The right-hand side can be rewritten as 

E^^G,L\0. 

Since L is a Poisson process on the space of lines, its Campbell measure has a 
simple decomposition with a Papangclou kernel equal to its intensity: 



E ^ ?A L \ /) = E f V L) dl. 
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Hence the split Campbell measure for T can be written as 

TGr(L)ceC(T) 

In view of Equation ^ , the right-hand side can be rewritten as 



cec(T 



Finally, using the measure dS on St defined by Equation ([5]), the right-hand 
side is expressed as 



=E / 0(5,T) dS. 

J 



Proposition [2] follows from the definition of the split Papangelou kernel. 

C Proof of Proposition [3] 

Let T ^ /i where /i is the distribution on T defined by Equation ([6]). In view 
of Definition [4| the flip Campbell measure of T is defined by 

TGr(L) FeJ^T 

The double sum can be rewritten using the change of variables 

(F, T)i^ (P ^ F~^,f = FT^ . 

Note that a flip does not change the lines supporting a T-tessellation: L(T) — 
L{f), i.e. T e TiL) ^ f € TiL). Furthermore, the flip F can be applied to 
FT ~ T: F E Ff. Hence the flip Campbell measure can be expressed as follows 

Ter(L) FeJ^f FeT-T 
Therefore the flip Papangelou kernel equals 1 for any pair {T, F) . 

D Proof of Proposition [5] 

For a given A C T^^, let 

A ^ |(T,f') : (f ,r) e .4| . 
It must be shown that /ism(^) = Msm(^)- In view of Equation (24|, 
fisAA)=Ef lAiST,T)dS + E J2 U{MT,T). 
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Consider the first term of tfie right-hand side and let (j>{S, T) = 1a{ST, T). Next 
use identity (fTTl). It can be checked that (t){M-'^,MT) = 1a{T, MT). Therefore 



E 



Now rewrite the second term of the right-hand side as 

E ^ 1a(MT,T)=E ^ 1a(MT,M~^MT). 



MeMi 



Use identity ([TT]) with (l){M~\MT) = 1a{MT, M-'^MT). It can be checked 
that (j){S, T) = 1^(7", ST). Therefore the second term of the right-hand side is 
equal to 

E I 1a(T,S'T) dS. 

J St 

Msm(i)=E 1a(T,MT)+e/' lA{T,ST)dS. 

nf^iVyf 'J St 



Hence, 



Compare with (24). 



E Proof of Proposition [6] 



Below it is proved that under the conditions of Proposition[6j T„ is i/j-irreducible 
where il) is the measure on T defined by ^'{B) = 1 if i? contains the empty 
tessellation Tq and V'(-S) = otherwise. One needs to prove that for any 
T-tessellation T there exists an integer m > 1 such that 

P(T™ =T|To -Ti,) >0. (32) 

First, let us consider the case where T ^ Tp. In view of Proposition [T] there 
exists a sequence (T'„)„=o,...,m of T-tessellations such that r„ = T, Tq = To 
and Tn+i = UnTn where C/„ is either a merge or a flip for any n = 0,...,m— 1. 



Note that whenever C/„ is a merge, it can be assumed to check condition (30 1 



The probability ( 32 ) is greater than or equal to the probability that 

i^n)n—Q,....m — (^n)n— 0,...,m- 

Since T„ is Markovian, the probability of the event above is equal to the product 
of conditional probabilities 

P(T„+i = C/„r„|T„ = T„) =pt„(T„)gt„(T„, ;7„)min{l,rt„(T„, {/„)}, 

where t„ — m if C/„ is a merge and t„ = f if [/„ is a flip. If [/„ is a merge, 



condition ( 30 ) implies that 



Pm(T'„)gm(T'„, f/„) > 0, 
rm(T„,f/„) > 0. 



If Un is a flip, condition (31 1 for T = T„ and F — Un implies that 

PiiTn)qiiT„, Un) > 0, 
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while condition (31) for T = UnTn and F = U,^^ implies that 

rf(r„,C/„) >0. 



Therefore there exists a m > 1 such that (32) holds for any T-tessellation 
T^Td. 

Now let us consider the case where T = Tq = T^. If either PraiTo) > 
or pf{Ti:,) > 0, a merge or a flip of Tq is considered with a probability greater 
than zero. However no such update is applicable to To and Ti = To = To 
almost surely. Therefore the probability that Ti — To is not zero. If both 
PmiTo) — PiiTo) = 0, a split of Tq is considered. If rejected, Ti = To, 
otherwise Ti is a tessellation with 2 cells and as shown above there exists a 
finite integer m such that the probability that the Markov chain reaches the 
empty tessellation Tq starting from Ti is positive. 
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